Contrastive Decoding

mentions 1 type Person feed RSS

// recent coverage 1 mentions

22:10

2025-12-10

rosmine.ai

large-language-models

A new test for if your LLM is subtly manipulating you

A method using Contrastive Decoding to detect when a large language model (LLM) is subtly suppressing information, such as avoiding mentions of a competitor's product. The author trained a "Manipulato…

// co-occurs with top 1 entities

Pytorch 1